Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.
نویسندگان
چکیده
Many large and small decisions we make in our daily lives-which ice cream to choose, what research projects to pursue, which partner to marry-require an exploration of alternatives before committing to and exploiting the benefits of a particular choice. Furthermore, many decisions require re-evaluation, and further exploration of alternatives, in the face of changing needs or circumstances. That is, often our decisions depend on a higher level choice: whether to exploit well known but possibly suboptimal alternatives or to explore risky but potentially more profitable ones. How adaptive agents choose between exploitation and exploration remains an important and open question that has received relatively limited attention in the behavioural and brain sciences. The choice could depend on a number of factors, including the familiarity of the environment, how quickly the environment is likely to change and the relative value of exploiting known sources of reward versus the cost of reducing uncertainty through exploration. There is no known generally optimal solution to the exploration versus exploitation problem, and a solution to the general case may indeed not be possible. However, there have been formal analyses of the optimal policy under constrained circumstances. There have also been specific suggestions of how humans and animals may respond to this problem under particular experimental conditions as well as proposals about the brain mechanisms involved. Here, we provide a brief review of this work, discuss how exploration and exploitation may be mediated in the brain and highlight some promising future directions for research.
منابع مشابه
the trade - off between exploitation and exploration Should I stay or should I go ? How the human brain manages
References s http://rstb.royalsocietypublishing.org/content/362/1481/933.full.html#related-url Article cited in: http://rstb.royalsocietypublishing.org/content/362/1481/933.full.html#ref-list-1 This article cites 41 articles, 10 of which can be accessed free Email alerting service here right-hand corner of the article or click Receive free email alerts when new articles cite this article sign u...
متن کاملExploration-exploitation trade-off features a saltatory search behaviour.
Searching experiments conducted in different virtual environments over a gender-balanced group of people revealed a gender irrelevant scale-free spread of searching activity on large spatio-temporal scales. We have suggested and solved analytically a simple statistical model of the coherent-noise type describing the exploration-exploitation trade-off in humans ('should I stay' or 'should I go')...
متن کاملI-21: Embryo Relinquishment for Reproduction
Background: The conceptualization of the transfer of embryos between the individuals who created them to one or more recipients for family-building is hotly contested – particularly as regards whether the practice should be most appropriately considered to be “donation” or “adoption”. This paper examines this debate, considering the research carried out on the intentions and decisions of those ...
متن کاملThe Rise of Patient Safety-II: Should We Give Up Hope on Safety-I and Extracting Value From Patient Safety Incidents?; Comment on “False Dawns and New Horizons in Patient Safety Research and Practice”
Who could disagree with the seemingly common-sense reasoning that: “We must learn from the things that go wrong.”? Despite major investments to improve patient safety, relatively few evaluations demonstrate convincing reductions in risk, harm, serious error or death. This disappointing trajectory of improvement from learning from errors or Safety-I as it is sometimes known has led some research...
متن کاملDopaminergic Control of the Exploration-Exploitation Trade-Off via the Basal Ganglia
We continuously face the dilemma of choosing between actions that gather new information or actions that exploit existing knowledge. This "exploration-exploitation" trade-off depends on the environment: stability favors exploiting knowledge to maximize gains; volatility favors exploring new options and discovering new outcomes. Here we set out to reconcile recent evidence for dopamine's involve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Philosophical transactions of the Royal Society of London. Series B, Biological sciences
دوره 362 1481 شماره
صفحات -
تاریخ انتشار 2007